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Abstract 

03 ■ We consider a multiple-input-multiple-output radar system and 

^, , derive a theoretical framework for the recoverability of targets in 

the azimuth-range domain and the azimuth-range-Doppler domain via 
sparse approximation algorithms. Using tools developed in the area 
i^) of compressive sensing, we prove bounds on the number of detectable 

^^ \ targets and the achievable resolution in the presence of additive noise. 

Our theoretical findings are validated by numerical simulations. 
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1 Introduction 

While radar systems have been in use for many decades, radar is far from 
being a 'solved problem'. Indeed, exciting new developments in radar pose 
great challenges both to engineers and mathematicians [6]. Two such devel- 
opments are the advent of MIMO (multi-input multi-output) radar [10], and 
the application of compressed sensing to radar signal processing j!5j . 

MIMO radar is characterized by using multiple antennas to simultane- 
ously transmit diverse, usually orthogonal, waveforms in addition to using 
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multiple antennas to receive the reflected signals. MIMO radar has the poten- 
tial for enhancing spatial resolution and improving interference and jamming 
suppression. The ability of MIMO radar to shape the transmit beam post 
facto allows for adapting the transmission based on the received data in a 
way which is not possible in non-MIMO radar. 

A radar system illuminates a given area and attempts to detect and de- 
termine the location of objects of interest in its field of view, and to estimate 
their strength (radar reflectivity). The space of interest may be divided into 
range-azimuth (distance and direction) cells, or range-Doppler-azimuth (dis- 
tance, direction and speed) cells in the case there is relative motion between 
the radar and the object. In many cases the radar scene is sparse in the 
sense that only a small fraction (often a very small fraction) of the cells is 
occupied by the objects of interest. 

Conventional radar processing does not take into account the a-priori 
knowledge that the radar scene is sparse. Recent works, such as [151 IZI] 
developed techniques which attempt to exploit this sparsity using tools from 
the area of compressed sensing [U [8]. The exploitation of sparsity has the po- 
tential to improve the performance of radar systems under certain conditions 
and is therefore of considerable practical interest. 

In this paper we study the issue of sparsity in the specific context of 
a MIMO radar system employing multiple antennas at the transmitter the 
receiver, where the two arrays are co-located. We note that related work 
on the application of compressive sensing techniques to MIMO radar can 
be found in [301 EI]- Our emphasis here is on developing the basic theory 
needed to apply sparse recovery techniques for the detection of the locations 
and reflectivities of targets for MIMO radar. 

The basic model for the problem we are considering involves a linear mea- 
surement equation y = Ax+w where y is a vector of measurements collected 
by the receiver antennas over an observation interval, A is a measurement 
matrix whose columns correspond to the signal received from a single unit- 
strength scatterer at a particular range-azimuth (or range-azimuth-Doppler) 
cell, x is a vector whose elements represent the complex amplitudes of the 
scatterers, and w is a noise vector. The measurement equation is assumed to 
be under-determined, possibly highly under-determined. The sparsity of the 
radar scene is introduced by assuming that only K elements of the vector 
x are non-zero, where K is much smaller than the dimension of the vector. 
The measurement matrix A embodies in it the details of the radar system 
such as the transmitted waveforms and the structure of antenna array. 



In this paper we study the conditions under which this problem has a 
satisfactory solution. This is a fundamental issue of both theoretical and 
practical importance. More specifically, the analysis presented in the follow- 
ing sections addresses the following issues: 

• It is known from the theory of compressed sensing [U [8] that the matrix 
A must satisfy certain conditions in order that the solution computed 
via an appropriate convex program will indeed coincide with the de- 
sired sparsest solution (whose computation is in general an NP-hard 
problem). In our problem the characteristics of this matrix depend on 
the choice of the radar waveforms and the number and positions of the 
transmit and receive antennas. We develop the results necessary for 
understanding how the selection of the parameters of the radar system 
affects the conditions mentioned above. 

• The ability of the algorithm to correctly detect targets depends on the 
number of these targets, K, and the signal to noise ratio. We show 
that as long as the number of the targets is less than a maximal value 
-Kmax) and the signal to noise is larger than some minimal value SNR m j n , 
the targets can be correctly detected with high probability by solving 
an ^-regularized least squares problem known under the name lasso. 
Explicit formulas are presented for K m3jX and SNR min as a function 
of the number of transmit and receive antennas and the number of 
azimuth and range cells. 

The structure of the paper is as follows. Subsection 11.11 introduces no- 
tation used throughout the paper. In Section [2] we describe the problem 
formulation and the setup. We derive conditions for the recovery of targets 
in the Doppler-free case in Section El and the case of detecting targets in 
presence of Doppler is analyzed in Section HJ Our theoretical results are sup- 
ported by numerical simulations, see Section [5j We conclude in Section |6j 
Finally, some auxiliary results are collected in the appendices. 

1.1 Notation 

Let v e C n . As usual, we define ||v||i := J2k=i l v fcl an< ^ IMI2 := VYlk=i l Vfc l 2 - 
For a given matrix A we denote its fc-th column by A& and the element in 
the i-th row and fc-th column by Au w. The operator norm of A is the largest 



singular value of A and is denoted by ||A|| op , the Frobenius norm of A is 



|| A || j? = \/X)i k l A M]l 2 - The coherence of A is defined as 

/an |(Afe,A;)| 

/i(A) := max — . (1) 

k & ||A fc || 2 ||A{||2 

For x G C n , let T r denote the circulant translation operator, defined by 

T T x(Z)=x(Z-r), (2) 

where I — r is understood modulo n, and let Mj be the modulation operator 
defined by 

M /X (Z) = x(Z)e 2rf/ . (3) 

2 Problem formulation and signal model 

We refer to [2U E] for the mathematical foundations of radar and to [TS] 
for an introduction to MIMO radar. However, the reader needs only a very 
basic knowledge of the mathematical concepts underlying radar to be able 
to follow our approach. 

We consider a MIMO radar employing Nt antennas at the transmitter 
and Nr antennas at the receiver. We assume that the element spacing is 
sufficiently small so that the radar return from a given scatterer is fully 
correlated across the array. In other words, this is a coherent propagation 
scenario. 

To simplify the presentation we assume that the two arrays are co-located, 
i.e. this is a mono-static radar. The extension to the bi-static case is straight- 
forward as long as the coherency assumption holds for each array. The arrays 
are characterized by the array manifolds: a#(/3) for the receive array and 
3lt{/3) for the transmit array, where (3 = sin(#) is the direction relative to 
the array. We assume that the arrays and all the scatterers are in the same 
2-D plane. The extension to the 3-D case is straightforward and all of the 
following results hold for that case as well. 

For convenience we formulate our theorems and analysis in terms of delay 
t instead of range r. This is no loss of generality, as delay and range are 
related by r = 2r/c, with c denoting the speed of light. 



2.1 The model for the azimuth-delay domain 



The z-th transmit antenna repeatedly transmits the signal Si(t). Let Z(t; /3, r) 
be the N R x A^ noise-free received signal matrix from a unit strength target 
at direction (3 and delay r, where N t is the number of samples in time. Then 



Z(t;/3, 



a R (/?)a£(/3)S^. 



where S r is an N t x iVr matrix whose columns are the circularly delayed 
signals Si(t — r), sampled at the discrete time points t = nA t , n — 1, . . . , N t . 
If r = 0, we often write simply S instead of So- 

Assuming uniformly spaced linear arrays, the array manifolds are given 

by 

1 

p j2ird T /3 



MP) 



and 



a*CS) 



J2nd T j3(N T -l) 



j2ird R f3 



P j2Trd B P(N R -l) 



(4) 



(5) 



where dx and df> are the normalized spacings (distance divided by wave- 
length) between the elements of the transmit and receive arrays, respectively. 
The spatial characteristics of a MIMO radar are closely related to that 
of a virtual array with NtNr antennas, whose array manifold is a(/3) = 
slt{P) <8> a#(/3). It is known [11] that the following choices for the spacing of 
the transmit and receive array spacing will yield a uniformly spaced virtual 
array with half wavelength spacing: 



d R = 0.5,d T = 0.5N R ; 
d T = 0.5,d R = 0.5N T . 



(6) 



Both of these choices lead to a virtual array whose aperture is 0.5(NtN r — 1) 
wavelengths. This is the largest virtual aperture free of grating lobes. The 
choices fl§l) and ([7|) will also show up in our theoretical analysis, e.g. see 
Theorem [TJ 



Next let z(t;(3,r) = vec{Z}(£; (3, r) be the noise-free vectorized received 
signal. We set up a discrete delay-azimuth grid {(/3j,Tj)}, 1 < i < Np,l < 
j < N T , where A^ and A T denote the corresponding discretization stepsizes. 
Using vectors z(t;/3 i ,r 7 ) for all grid points (0i,Tj) we construct a complete 
response matrix A whose columns are z(t; Pi, Tj) for 1 < i < Np and 1 < j < 
N T . In other words, we have N T delay values and Np azimuth values, so that 
A is a N R N t x N T Np matrix. 

Assume that the radar illuminates a scene consisting of K scatterers 
located on K points of the (/3, Tj) grid. Let x be a sparse vector whose 
non-zero elements are the complex amplitudes of the scatterers in the scene. 
The zero elements corresponds to grid points which are not occupied by 
scatterers. We can then define the radar signal y received from this scene by 

y = Ax + v (7) 

where y is a N R N t x 1 vector, x is a N T Np x 1 sparse vector, v is a N R N t x 1 
complex Gaussian noise vector, and A is a N R N t x N T Np matrix. 

2.2 The model for the azimuth-delay-Doppler domain 

The discussion so far was for the case of a stationary radar scene and a fixed 
radar, in which case there is no Doppler shift. The extension of this signal 
model to include the Doppler effect is conceptually straightforward, but leads 
to a significant increase in the problem dimension. 

The signal model for the return from a unit strength scatterer at direction 
/3, delay r, and Doppler / (corresponding to its radial velocity with respect 
to the radar) is given by 

Z(t;/3,r,/)=a iJ (/3)a?(/3)S^ / , 

where S T j is a N t x N? matrix whose columns are the circularly delayed and 
Doppler shifted signals Si(t — r)e j27r ^'. 

As before we let z(t; j3, r, /) = vec{Z}(£; (3, r, /) be the noise-free vector- 
ized received signal. We extend the discrete delay-azimuth grid by adding a 
discretized Doppler component (with stepsize A/ and corresponding Doppler 
values / = kAf, k — 1, . . . , Nf) and obtain a uniform delay-azimuth-Doppler 
grid {{/3i,Tj,fk)}. Using vectors z(t;j3i,Tj,fk) for all discrete {/3i,Tj,fk) we 
construct a complete response matrix A whose columns are z(t; /%, Tj, fk) for 
1 < i < Np, 1 < j < N T , 1 < k < N f . 
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Assume that the radar illuminates a scene consisting of K scatterers 
located on K points of the (fl,Tj,fk) grid. Let x be a sparse vector whose 
non-zero elements are the complex amplitudes of the scatterers in the scene. 
The zero elements corresponds to grid points which are not occupied by 
scatterers. We can then define the radar signal received from this scene y by 

y = Ax + v (8) 

where y is a NnN t x 1 vector, x is a N T N/sNf x 1 sparse vector, v is a NftN t x 1 
complex Gaussian noise vector, and A is a N^N t x N T NpNf matrix. 

2.3 The target model 

We define the sign function for a vector z G C n as 

( v \z k /\z k \ if^^O, 
SSn{Zk) = \0 else. (9) 

We introduce the following generic K-sparse target model: 

• The support Ik C {1, . . . , N T Np\ of the K nonzero coefficients of x is 
selected uniformly at random. 

• The non-zero coefficients of sgn(x) form a Steinhaus sequence, i.e., 
the phases of the non-zero entries of x are random and uniformly dis- 
tributed in [0, 2ir). 

We do not impose any condition on the amplitudes of the non-zero entries 
of x. We do assume however that the targets are exactly located at the 
discretized grid points. This is certainly an idealized assumption, that is not 
satisfied in this strict sense in practice, resulting in a "gridding error". We 
refer the reader to [T6l [7] for an initial analysis of the associated perturbation 
error, and to [H] for an interesting numerical approach to deal with this issue. 

2.4 The recovery algorithm — Debiased Lasso 

A standard approach to find a sparse (and under appropriate conditions the 
sparsest) solution to a noisy system y = Ax + w is via 

min — ||Ax — yllo + A||x||i, (10) 

x 2 



which is also known as lasso [2B]. Here A > is a regularization parameter. 
In this paper we adopt the following two-step version of lasso. In the first 
step we compute an estimate I for the support of x by solving f lTOj) . In the 
second step we estimate the amplitudes of x by solving the reduced-size least 
squares problem min || A/x/ — y H2, where Aj is the submatrix of A consisting 
of the columns corresponding to the index set /, and similarly for xj. This 
is a standard way to "debias" the solution, we thus will call this approach in 
the sequel debiased lasso. 

3 Recovery of targets in the Doppler-free case 

We assume that Si(t) is a periodic, continuous-time white Gaussian noise 
signal of period-duration T seconds and bandwidth B. The transmit wave- 
forms are normalized so that the total transmit power is fixed, independent 
of the number of transmit antennas. Thus, we assume that the entries of 
Si{t) have variance -^-. It is convenient to introduce the finite-length vector 
Sj associated with Sj, via Sj(Z) := Si(lA t ), 1 = 1,..., N t , where A t = -^ and 
N t = T/A t . 

Theorem 1 Consider y = Ax + w, where A is as defined in Subsection \2.1\ 

and Wj G C7V(0,er 2 ). Choose the discretization stepsizes to be Ap = N 2 N 
and A T = ^. Let d T = 1/2, d R = N T /2 or d T = N R /2,d R = 1/2, and 
suppose that 

N t > 128, N T > v^V>, and (log(iV r iV») 3 < N t . (11) 

7/x is drawn from the generic K -sparse target model with 

K < K max := CoN ^ Nr n (12) 

- max 3jv t log(N T Np) V ; 

for some constant Cq > 0, and if 

?j? |xfc| > -^r^iogiv^, (13) 

kei V N R N t 

then the solution x of the debiased lasso computed with A = 2o^/2 log(A r T A r / g) 
obeys 

supp(x) = supp(x), (14) 

8 



with probability at least 

(l- Pl )(l-p 2 )(l-p 3 )(l-p A ), 
and 



II X I|2 ||y||2 

with probability at least 



|x-x|| 2 aJ12N t N R , . 

< I, n (15) 



;1-Pl)(l-P2)(l-P 3 )(l-P4)(l-P5), 



where 



S2 > 



Pl = e 3 + iV i 1 - CA \ 

p 2 = 26"^^ + 2(iV i? iV T )" 1 - 6(NtN fi )-\ 

(l~VTj3) 2 N t N R N t 

p 3 = e 2 , p 4 = N R N T e ^ , 

and 

p 5 = 2(N T N^r 1 (27r\og(N T N (S ) + ^(A^)- 1 ) + O((N T N )- 2l °* 2 ). 

Remark: 

(i) While the expressions for the probability of success in the above theo- 
rem are admittedly somewhat unpleasant, we point out that the indi- 
vidual terms are fairly small. Moreover, the probabilities can easily be 
made smaller by slightly increasing the constants in the assumptions 

on N t , N R , N T . 

(ii) The assumptions in ( II ip are fairly mild and easy to satisfy in practice. 

(iii) We emphasize that there is no constraint on the dynamic range of the 
target amplitudes. The lasso estimate will recover all target locations 
correctly as long as they exceed the noise level (TT3|) . regardless of the 
dynamical range between the targets. 

(iv) We note that \xk\ 2 /cr 2 is the signal-to- noise ratio for the /c-th scatterer 
at the receiver array input. The measurement vector y provides N R N t 
measurements of Xk- Therefore it is useful to define the signal-to- noise 
ratio associated with the /c-th scatterer as SNR^ = N R N t \xk\ 2 jo 2 . This 

9 



is often referred to as the output SNR because it is the effective SNR at 
the output of a matched-filter receiver. Equation ( I13p can thus be writ- 
ten as SNRfc > 200 log N T Np, However, the factor 200 is definitely way 
too conservative. As is evident from the comments following Theorem 
1.3 in [3], one can replace the factor 10 in ffT3|) by a factor (1 + e) for 
some e > 0, at the cost of a somewhat reduced probability of success 
and some slightly stronger conditions on the coherence and sparsity. 
This indicates that the SNR condition for which perfect target detec- 
tion can be achieved is 



SNR > SNR min := C log N T Np, 
where C is a constant of size 0(1). 



(16) 



(v) The condition that the target locations are assumed to be random can 
likely be removed by using a different proof technique that relies on a 
dual certificate approach (e.g. see [5]) and tools developed in [22]. We 
do not pursue this direction in this paper. 

The proof of Theorem [1] is carried out in several steps. We need two key 
estimates, one concerns a bound for the operator norm of A, the other one 
concerns a bound for the coherence of A. We start with deriving a bound 
for || A || op . 



Lemma 2 Let A be as defined in TheoremUl Then 

P(||A|| 2 „ > N t N R N T (l + log JV t )) < N}~ CNt , 
where C > is some numerical constant. 



;i7) 



Proof: There holds ||A||„ 
as block matrix 

B N R ,1 



|AA*|| op . It is convenient to consider AA* 



Bi, 



Bi, 



,N R 



B 



N R ,N R 



where the blocks {Bjy}^ jf =1 are matrices of size N t x N t . We claim that A A* 
is a block- Toeplitz matrix (i.e., B^/ = B i+1 y +1 , i — 1, . . . , A^^ — 1) and the 
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individual blocks B^y are circulant matrices. To see this, recall the structure 
of A and consider the entry Bu.^;/], i, il = 1, . . . , Nr, 1,1' = 1, . . . , N t : 

B[j,z ; j',z'] = ( AA*) fcj. i>jr] = y j y j A[ it i.^ iT ]A[i> t i'.p :T ] 

/9 r 



= EE^^E^W^* - ™A T K0^ 4 ,X> T (/3) fc ,^(/'Ai - nA T ) 

n=l fc=l fc'=l 

N T N T N T 

= ]T slrWMPJv E E a T (/3) fc ^p) fc , ]T Sfc (ZA f - nA r )s fc ,(/'A t -nA T ) 

/3 k=l fc'=l n=l 

= ^ ^^(i-O/J J^ Y, e jM ^ k - k '^ J2 s ^ lA * - nA T )s k ,(l'A t -nA T ), 

f3 fc=l fc' = l 71=1 

(18) 

where we used the delay discretization r = nA T , n = 1, . . . , N T . The block- 
Toeplitz structure, B^/ = Bf+i^+i, follows from observing that the expres- 
sion (I18p depends on the difference i — i', but not on the individual values of 
i, i'. The circulant structure of an individual block B^/ (i,i' are now fixed) 
follows readily from noting that 

N r N T 

J2 s k (lA t -nA T )s k ,(l'A t - nA r ) = ^ s k {(l+l)A t -nA T )sv((l' + l)A t - nA T ), 

n=l n=l 

since we have chosen A t = A T and since the shifts are circulant in this case. 
We will now show that the blocks B^/ are actually zero-matrices for i ^ %' . 
For convenience we introduce the notation 

N T 

G ktk ,(l, I') := Y Sk(lA t -nA T )s k ,(l'A t - nA T ), I, I 1 = 1, . . . , N t ; k, k' = 1, . . . , JV r , 

71=1 

Substituting dr = 1/2, dn = Nt/2 (the very similar calculation for dn = 
1/2, dx = Nr/2 is left to the reader) and the discretization (3 = nAp,n = 
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1, . . . , Np, with Ap = N 2 N in ( 1181) we can write 



N„Nrr__ 



— ' — 1 __ N T N T 






N T N T N R N T -1 

= EE Gfc ' fe '( / ' r ) E e^^^^e^^^F. (19) 

fe=l fc'=l n=0 

We analyze the inner summation in ffT9l) separately. 

N R N T -1 Nt-1N r -1 

n=0 ni=0 ri2=0 

ATr-1 JV t -1 




1 for all i, i' 



N T -1 

ri2=0 ni=0 

N R -1 

e -J v iN R N Te -i v >N RNTOk-k'- 

n 2 =0 



Hence 



fc=l fe'=l n 2 =0 = i f or k = k' 



B,, w] =ATt££cWGV(U') £ ef^ fc 2f^^ ( "' )] 



7V T JVfl-1 A^T 

fc=l ri2=0 fc=l 

Thus, Bjj/ = for i ^ i', and A* A is indeed a block-diagonal matrix, 
which in turn implies ||A||;L, = max, ||Bjj|| op . But due to the block- Toeplitz 
structure of A* A we have Bi i = B 2i2 = • • • = T$n r ,n r - Therefore 

l|A|| 2 p =||B 1 , 1 || op . (20) 
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To bound HB^iHop we utilize its circulant structure as well as tail bounds 
of quadratic forms. Let b be the first column of B^i, then HB^iHop = 
i/A^llblloo where b is the Fourier transform of b. From our previous compu- 
tations we have (after a change of variables) 

b(Z) = N T N R E G k , k (l, 0) = N T N R E J2 s k (nA T -lA t )s k (nA T ), 1 = 0,..., N t -1. 

fc=l fc=l n=l 

We will rewrite this expression so that we can apply Lemma H2] to bound 
||b||oo- Let Ttv 4 denote the translation operator on C^' as introduced in (J2J) 
and define the N t Nj> x N t Nx block- diagonal matrix U^ = {u\J} by 

U« := N R N T y/N t I NT ®T l Nt , for I = 0, . . . , N t - 1. (21) 

Furthermore, let z = [sf 1 , sf , . . . , s^ T ] T , then 

N T N t N T 

V / iV*b(0 = v^ T A^E( s *' T W = (z,U«z),= J] ugz iZ *'. 

fc=l i,i'=l 

and therefore 

N t -lN t N T N t N T N t -1 N t N T 

y^Mk) = -i= E £ ugM^«/* = E ^4f E ^'^ = E w# 



1 1=0 Li'=l i,i'=l v ' 



1 1=0 i,i'=l 



where we have denoted v$ := ^ ES" 1 u^e?** 1 */* 1 * for z, »' = 0, . . . , AyV T - 
1 and fc = 0, . . . , N t — 1. It follows from (12 ip and standard properties of the 
Fourier transform that the matrix V^ fc ^ := {t^, } is a block-diagonal ma- 
trix with Nt blocks of size Nt x A^, where each non-zero entry of such a 
block has absolute value N r Nt- Furthermore, a little algebra shows that 



||V( fe )|| F = y/NjNjNS, ||V( fc )|| op = N t N R N T , trace(VW) = N t N R N T , and 

iV t JV T 

J5.-Z.v7;>., I = 



AT t iV T 1 

E( E Mi'U?) = — trace(vW) = AWflWr. 



We can now apply Lemma [T2l (keeping in mind that x\ ~ CA/"(0, 7^-)) and 
obtain 



/ ( tN t 2 N^ i \ 

P(l V^b( | > iV«iV fl iV r + < exp ( - C- {-^-, _Z,}) , 



tN T t 2 N% 

t N R ly l 
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where C > is some numerical constant. 
Choosing t = N t N R NT log N t gives 

¥(\y/W t b(l)\ > N t N R N T (l + logN t )) < exp(-CN T \ogN t 



(!■ 



for I = 0, . . . , Nt — 1. Forming the union bound over the N t possibilities for 
/ gives 



N t -1 

1-CNt 

t 



P(max{| v / iV(b(/)|} > N t N R N T {l+\ogNt)) < ^T exp(-Cy/N^logN t ) = N t 



i 

1=0 

(22) 

We recall that ||B l5l || op = max; \y/Nb(l)\, and substitute (|22|) into (1201) 

to complete the proof. □ 

Next we estimate the coherence of A. Since the columns of A do not 
all have the same norm, we will proceed in two steps. First we bound the 
modulus of the inner product of any two columns of A and then use this 
result to bound the coherence of a properly normalized version of A. Since 
the columns of A depend on azimuth and delay, we index them via the 
double-index (r, j3). Thus the (r, /3)-th column of A is A T)/ g. 

Lemma 3 Let A be as defined in Theorem^ Assume that 

N t 



N T > v^ and log{N T N p ) < ^, (23) 

then 



max \(A T ^A T ^)\<3N R JNtlog(N T Np) (24) 

(t,/3)^(t> ,/3>y ' V 

with probability at least 1 - 2(N R N T y l - 6(iV r A^ i? A^ T )~ 1 . 

Proof: We assume d T — §, d R = -£- and leave the case d T = -^, d R = \ to 
the reader. We need to find an upper bound for 

max|(A Ti/3 ,A T ^)| for (r,/3) ± (r',/3'). 

It follows from the definition of z(t;/3, r) via a simple calculation that 

A r>/3 = a fl (/3) <g) (S r a T (/3)), 

from which we readily compute 

(A r)/3 ,A r ^> = (a fl (/3),a R (/3'))(S r a T (/3),S T ,a T (/3')>. (25) 
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We use the discretization (3 = nAp, (3' = n'Ap, where A^ = N 2 N , n,n' = 
1, . . . , Np, with Np = NrNt, and obtain after a standard calculation 

IMP), a*OT> = {"" I"' "', = kN - i0 ' k = °-- N -- 1 - (26) 

10 if n - n f kA' r , 

and 

JO if n-n' = kN R for A; = 1,...,JVt-1, 

(a T (/3),a T (/3)> = <> 

[^(a T (/3),a T (/3)) if n - n' = 0. 

(27) 
As a consequence of (1261) . concerning (3, (3' we only need to focus on the case 
n — n' = kN R for k — 1, . . . , iVr — 1. Moreover, since 

(S T a T (/3), S r ,a T (/3')} = (S r _ T ,a T (/3), Sa T (/3')>, for r,r' = 0, . . . ,N T - 1, 

and |(S T aT(/3), a-r(/3'))| = |(Sjv t -ra-T(/?),a-T(/3'))|, we can confine the range 
of values for r, r' to r' = 0, r = 0, . . . , N t /2. 

We split our analysis into three cases, (i) j3 ^ j3', r — 0, (ii) /3 7^ /?', r 7^ 0, 
and (iii) /? = /?', r ^ 0. 

Case (i) p ^/3',r = 0: We will first find a bound for |(a fl (/3), a i? (/3'))(a T (/3), a T (/3')>| 
and then invoke Lemma [TTlto obtain abound for \{a R ((3), a R ( / 9'))(Sa T (/3), Sa T (/9'))|. 

Based on ([26]) and (J2ZJ) , to bound \{a. R (/3),a. R {/3')){Sa. T {(3),Sa. T {(3'))\ we 
only need to consider those n, n' for which n — n' is not a multiple of N R , in 
which case sltW) and sltW) are orthogonal. We have 

|(a fl (/3),a K (/3 / ))(Sa T (/3),Sa T (/3 / ))| < JV* |(S*Sa r (/3),a T (/3')>|. (28) 

By Lemma [TT] there holds 

p(|(S*Sa T (/3),a r (/?'))| > tN t ) < 2exp ( - N t - * C J ) (29) 

for all < t < 1, where Ci = -7= and C 2 = y/8e. We choose £ = 



3 x /± \og(N T N R N T ) in ([21]) and get 

9\og(N T N R N T ) 



F(\(S*SaL T (l3),8L T {/3'))\>3y/N t log(N r N R N T )) <2exp 



d + ^v/log^iVfliVT; 
(30) 
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We claim that 

9log(N T N R N T 



^>2\og(N R N T ). (31) 

C 1 + mJhg(N T N R N T )- 



To verify this claim we first note that ( I3TJ) is equivalent to 

6Q 
/fit 

Using both assumptions in ( 123]) and the fact that 2Ci + ^§= — 9 < | we obtain 



6C I 

91ogiV T > log(iV i? iV T )(2C , 1 + —Ly/logWrNf,) - 9). 



30 



91ogiV r > logNppd + -=| - 9) > logiV^d + -j^JlogiNtNp) - 9), 
which establishes (J3"T|) . Substituting now ( |3"T|) into f )30|) gives 



P(^|(S*Sa T (/3),a T (/3 , ))| > 3^N t log(N T N R N T )j < 2exp ( - 2log(N R N T )). 

(32) 
To bound max |(A Tii g, A Tii g/)| we only have to take the union bound over 
N R Nj- different possibilities associated with j3,/3 ; , as r = r' = 0. Forming 
now the union bound, and using f )28|) . yields 



P 



(|(A Ti/ 3, A T|/9 ,)| < 3N Ry /N t log{N T N R N T j) > 1 - 2(iV R iV T )^ 1 . (33) 



Case (ii) j3 ^ /3', r ^ 0: We need to consider the case |(S r a T (/3), Sa T (/3'))| 
where /3 = nA^, /?' = n'Ap, with n — n' = kN R for A; = 1, . . . , JV^ — 1- Since 
the entries of S are i.i.d. Gaussian random variables, it follows that the entries 
of S T aT(/3) are i.i.d. CAf(0, l)-distributed, and similar for Sa r (/3'). More- 
over, the fact that (a^/^), sltW)) = implies that S T ar(/3) and Sslt((3') 
are independent. Consequently, the entries of Z~2i2o (ST&T(/3))i(Sa.T(/3'))i 
are jointly independent. Therefore, we can apply Lemma [14] with t = 
3^N t \og(N T N R N T ), form the union bound over the N T N R N T possibilities 
associated with r (we do not take advantage of the fact we actually have 
only N T — 1 and not N T possibilities for r) and /3, /3' (here, we take again into 
account property (I2"5"|) ). and eventually obtain 

p(|(A Ti/3 , A T , J3 ,)\ < 3N RA /N t log(N T N R N T )^ > 1 - 2(N T N R N T )-\ (34) 
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Case (iii) f3 = (3', r ^ 0: We need to find an upper bound for | (S T a r (/3), Sa r (/3)) | 
where t = 1, . . . ,N t — 1. Since Since each of the entries of S t sltW) and of 
Sa^(/3) is a sum of Nt i.i.d. Gaussian random variables of variance 1/Nt, we 
can write 

N t -1 

|<S r a T (/3),Sa T (/?)}| = | J>_ r ^|, (35) 

«=o 

where gi ~ A/"(0, 1). Note that the terms gi- T gi in this sum are no longer 
all jointly independent. But similar to the proof of Theorem 5.1 in [20] we 
observe that for any r / we can split the index set 0, . . . ,N t — 1 into 
two subsets A*, Al C {0,...,N t - 1}, each of size N t /2, such that the N t /2 
variables g{l — r)g(l) are jointly independent for / e A*, and analogous for 
A 2 . (For convenience we assume here that Nt is even, but with a negligible 
modification the argument also applies for odd N t .) In other words, each 
of the sums J2i eAr g(l — r )5 , (0> r = 1, 2, contains only jointly independent 
terms. Hence we can apply Lemma HH and obtain 

for all t > 0. Choosing t = l^N t \og(N t N R N T ) gives 

P (I E HI - r)M\ > I V X logWAW ) < 2 exp ( - - |f '; S( , Af :^ ) jv , 

v ^ 2 +3 v /A r ilog(A r t A^ i? Ar Tj 



< 2 exp ( 



9\og(N t N R N T ) 



2 4 12 / lo S(NtN R N T ) 



(36) 



Condition ([23]) implies that 12 J log(JV ^ flJVr) < |, hence the estimate in ()36 
becomes 

P (l X>(*-tM0| > ^logf^iVrlv^) <2exp( 



9 log(N t N R N T ) 



2 4- - 
zeAr ~ z + 2 



2exp(-21og(iVyV fl iV T )) 

2(iV i iV i? iV T )- 2 . (37) 
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Using equation (I35|) . inequality ( 13TJ) . and the pigeonhole principle, we obtain 
p(|(S T a r (/3),Sa r (/3))| > 3y/N t log(N t N R N T )) < 4(iV i iV R iV T )- 2 , 



Combining this estimate with (|25p yields 

p(|(A Ti/3 , A rV )| > 3N RV /N t \og(N T N R N T )) < A(N t N R N T )- 2 



We apply the union bound over the ^NtN r different possibilities and arrive 

at 

p(max|(A Tj/3 , A r ,^)| < 3N Ry / N t log(N T N R N T )) > 1 - A(N t N R N T )-\ 

(38) 
where the maximum is taken over all r, r', (3, (3' with r ^ r' . 

An inspection of the bounds fl33l . flMl) . and fl38|) establishes (pHj) . which 
is what we wanted to prove. □ 

The key to proving Theorem [1] is to combine Lemma [2] and Lemma [3] 
with Theorem [15j The latter theorem requires the matrix to have columns 
of unit-norm, whereas the columns of our matrix A have all different norms 
(although the norms concentrate nicely around y/NtNjtNr). Thus instead of 
Ax = y we now consider 

Az = y, where A := AD -1 and z := Dx. (39) 

Here D is the N T Np x N T Np diagonal matrix defined by 

D(r,/3),(r,/3) = ||A r ^|| 2 . (40) 

In the noise- free case we can easily recover x from z via x = D _1 z. In the 
noisy case we will utilize the fact that for proper choices of A the associated 
lasso solutions of (ITU]) and (1501) . respectively, have the same support, see also 
the proof of Theorem [TJ 

The following lemma gives a bound for /i(A) and || A|| op in terms of the 
corresponding bounds for A. 

Lemma 4 Let A = AD -1 , where the D the diagonal matrix is defined 
by ( 140]) . Under the conditions of Theorem d there holds 

p(||A|| 2 op <3(l + logJV t )) >1-Pi, (41) 



where pi = e Nt 2 — N t T , and 



p(//(A) < 6 J±- log(N T N R N T )) >1-P2, (42) 



where p 2 = 2e^ M ^ L - 2(N R N T )~ l - e^A^A^)- 1 . 
Proof: We have 

l| A H2 



lo P — ™„ v . II A .112 

Recall that 



A IP < """ op (A-i) 

A op — II a 112' V^°/ 

max T ^ A Tj/3 2 



A Ti/J = a fl C9) ® (S r a T (/3)), (44) 

hence ||A T) ^||| = ||a ie (/3)|||||S T a r (/3)|||. Since the entries (S r a r (/3)) fc ~ C7V(0, JV T ) 
we have E||S r a^(/3)|| = y/Nt, and thus by Lemma [9] 

p( v / ^t-||S T a T (/3)|| 2 >t) < e -4, (45) 

for all £ > 0, hence 

Choosing £ = ( 1 — yl/3) \/iVt in ( 1461) and forming the union bound only over 
the NrNt different possibilities associated with /3 (note that ||S T -a^r(/3) H2 = 
1 1 Say (y9) || 2 for all r), gives 

Vmax||A Ti/3 ||^ N t N R J 

The diligent reader may convince herself that the probability in (14T|) is indeed 
close to one under the condition (ITT]) . We insert ( 1T7|) and (J4"T|) into (143|) and 
obtain 

p(||A||2p < 37V T (l + logiV 4 )) > 1 - e- w-f 75 * _ ^i-cVSF (4g) 

which proves (|4T|) . 

To establish (1421) we first note that 



^( A ) ^ fr ^, { D (i) ) (r, / 9)l(A*A) (T ^ ) , (T ,^ ) |D (T ^, ):(T/ ^ ) }, (49) 



(r,/3)^(r',/3') 
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where D, *gw „■> = ||A T/3 || 2 *. Using Lemma M and ( H3|) we compute 
P(||A T)/3 || 2 > v/iVtiVn - y/N^t] > 1 - e~T. 



Therefore 

™/ 1 1 \ * 2 

and thus 

pf|A*A)f T m r r > fl/J < ; ; ; : |(A*A)( r m f T / «a| > ) > 1 - 2e _ £, 

By choosing £ = (1 — l/y/2)-\/Nt, we can write 1 J501 as 

2 ,, . „ . s ,\ _ N t (V2-l) 2 

4 



P(|A*A) (T)/3))(r , i/30 | < ^|(A*A) (TiWi( ^|j > 1 -2e 
Finally, plugging ( 150]) into (j49|) and using ( 124)) we arrive at 



/ /I \ iVWx/2-1) 2 

P(^(A) < 6J — log(N T N R N T )) > l-2e J -2(iV K 7V T )" 1 -6(7V t 7V fl 7VT)- 1 . 

□ 
We are now ready to prove Theorem [TJ Among others it hinges on a 
(complex version of a) theorem by Candes and Plan [3j, which is stated in 
Appendix B. 

Proof of Theorem [1} We first point out that the assumptions of Theo- 
rem [1] imply that the conditions of Lemma [2] and Lemma [3] are fulfilled. For 
Lemma [2] this is obvious. Concerning Lemma [31 an easy calculation shows 
that the conditions {log{N T N R N T )) 3 < N t and N t > 128 indeed yield that 
\og(N t N R N T ) < f . 

Note that the solution x of (ITU]) and the solution z of the following lasso 
problem 

min-||AD~ 1 z-y|| 2 ! + A||z|| l7 with A = 2ay/2log(N T N R N T ), (50) 
z 2 

satisfy supp(x) = supp(D _1 z). 

We will first establish the claims in Theorem [1] for the system Az = y 
in (15§j) where A = AD 1 , z = Dx and then switch back to Ax = y. 
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We verify first condition (1771) . Property (1131) and the fact that z = Dx 
imply that 



10||A T ,fl|| 2 



\z k \ > ^j^§AaJ2log(N T N p ), for (r, /3) e S. (51) 

Using Lemma M we get that 

PfjIA^H > y/N R N t - t) > 1 - e~T. (52) 



Choosing £ = ttt \JN R N t and combining f )52|) with f )5T]) gives 



N > 8aJ2log(N T N p ), for fc € 5, 

N R N t 

with probability at least 1 — e~ 25 , thus establishing condition ( 1771) . 

Note that A has unit-norm columns as required by Theorem [151 It re- 
mains to verify condition f J75|) . Using the assumption (Hip , and the coherence 
bound fH2|) we compute 

.2, x^o*l ,-, ^ *,*, w o« log(JV T JV fl JV T ) _ 36 



/AA) < 36— \og(N T N R N T ) < 36 



^ log 3 (N T N R N T ) " log 2 (N T N R N T y 

which holds with probability as in ( 142|) . and thus the coherence property (175 p 
is fulfilled. 

Furthermore, using (|4"T|) we see that condition ( 1T2|) implies 

K < c iV r iV fi < c iV r Ar R 



3(lH-logA^)log(^ T ^^ T ) - ||A||2 p log(Ar T A^ R iV r ) 

with probability as stated in (141 1) . Thus assumption ( 1761) of Theorem [T5l is 
also fulfilled (with high probability) and we obtain that 

supp(z) = supp(z). (53) 

We note that the relation supp(x) = supp(x) holds with the same probabil- 
ity as the relation supp(z) = supp(z) (see equation (j5"3"|) ). since supp(z) = 
supp(x) and multiplication by an invertible diagonal matrix does not change 
the support of a vector. This establishes ( fT4l) with the corresponding prob- 
ability. 
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As a consequence of (1791) we have the following error bound 

||Z-Z|| 2 < 3(Ty/N T N(, 



||z|| 2 ||y||2 

which holds with probability at least 

(l- Pl )(l-p 2 )(l-e~^)(l-2(N T Npy\2n\og(N T Np)+K(N T Npy 1 )-0((N T N^ 

where the probabilities pi,p2 are as in Lemma HI Using the fact that z = Dx, 
we compute 

1 ||x-x|| 2 ||D(x-x)|| 2 ||z-z|| 2 



k(D) ||x|| 2 l|Dx|| 2 ||z|| 2 

or, equivalently, 

l|x-x|| 2 ||z-z|| 2 

ll x ll ^( p ) ll z ll • ( 55 ) 

ll X l|2 || z ||2 

Proceeding along the lines of (145|) -( 147|) . we estimate 

N t (i--\/iT3) 2 

P(«(D) < 2) > 1 - iV fl iV r e 2 • (56) 

The bound ffT5|) follows now from combining (1511) with (155]) and fl56l) . □ 

4 Recovery of targets in the Doppler case 

In this section we analyze the case of moving targets/antennas, as described 
in 12.21 As in the stationary setting, we assume that Si(t) is a periodic, 
continuous-time white Gaussian noise signal of period-duration T seconds 
and bandwidth B. The transmit waveforms are normalized so that the total 
transmit power is fixed, independent of the number of transmit antennas. 
Thus, we assume that the entries of Si(t) have variance -£-. 

Theorem 5 Consider y = Ax + w, where A is as defined in Subsection \2.2\ 
and Wj G CA/"(0,cr 2 ). Choose the discretization stepsizes to be A@ = N 2 N , 
A T = ^ and A f = ±. Let d T = 1/2, d R = N T /2 or d T = N R /2,d R = 1/2, 
and suppose that 



N t > 128, max{A T , N f , ^/N T , N f } > y/Wp, and (log(N T Np)) < N t 
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Ifx is drawn from the generic K -sparse target model with 

._ cpN T N f N R 
" max - Qlog(N T N f N,) 

for some constant c > 0, and if 

?j? |xfe| > "T^W 2 log N T N f N p , 
fee J V N R N t 



then the solution x of the debiased lasso computed with X = 2<ta/2 \og(N T NfNp 
obeys 

supp(x) = supp(x), 

with probability at least 

(l- Pl )(l-p 2 )(l-p 3 )(l-p 4 ), 

and 

||x-x|| 2 < ay/l2N t N R 



Il x ll2 ||y||2 

wi& probability at least 

(l- Pl )(l-p 2 )(l-p 3 )(l-p 4 )(l-p 5 ), 

where _ 

Pl = e s + iV Te -(V 3 /2-V2)7v t5 

(l-y^73) 2 iVt N R N t 

p 3 = N R N T e 2 , p 4 = e 25 , 

and 

p 5 = 2(iV T iV / 3)- 1 (27rlog(iV r iV /3 ) + SW^)" 1 ) + 0((N T N p )- 2l< ^ 2 ). 

Proof: The proof is very similar to that of Theorem [TJ Below we will 
establish the analogs of the key steps, Lemma |2l Lemma [31 and Lemma HI 
and leave the rest to the reader. □ 

Lemma 6 Let A be as defined in Theorem^ Then 

P(j|A|| 2 p < 2N t N f N R NT) > 1 - N T e- Nt ^-^\ (57) 
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N t (V2-l) 2 



Proof: We proceed as in the proof of Lemma [2J There holds ||A||^ 

|| A A* || op . It is convenient to consider A A* as block matrix 



Bii B 



B N R ,l 



1.2 



Bi, 



iV fi 



B 



N R ,N R 



where the blocks {B^j/}^ * =1 are matrices of size N t x N t . We claim that A A* 
is a block- Toeplitz matrix (i.e., B^/ = Bj+i^+i, i — 1, . . . , Nr — 1) and the 
individual blocks B^j/ are circulant matrices. To see this, recall the structure 
of A and consider the entry Br^/ m, i,i' = 1, . . . , Nr, 1,1' = 1, . . . , N t : 



B[i,^y] = (AA*) [iiW /j = ^^^A[ ii/;r j i/3] A 



[»'/;t,/,j 



7V T 7V T 



AT/ 



V^ e J2wd R (i~i')l3 y^ y^ ^^{k-k^Q^ (^ y\ Sp ^(J-OAtmA, 



fc=l fc'=l 



m=l 



N R N T -1 

E ■ 

n=0 



J2TT(i-i') 



.,, nJV< 



7V T 7V T 



fljVr EE 



«*(*-*) ^ 



-■G k>k ,{l,l')N f 8, 



i-v 



fc=i fc'=i 



A' T 



N T N R N f J] Kf^-^i-J' 



(58) 
(59) 



fc=i 



2B 



where we have used in (1581) that Nf = 
Nf5i-i'. Thus 



2BT, whence ^Hi e^ 2 ^ l -^ mAtA f 



N T 



AA* = (N T N R N f J2\\ s k\\ 2 )I, 



(60) 



fc=i 



i.e., AA* is just a scaled identity matrix. Since s^ is a Gaussian random 
vector with Sk(j) ~ CA/"(0, 1), Lemma |9] yields 



P(||s fc ||£ - (E||s fc || 2 ) 2 > t(t + 2E||s fc || 2 )) < e 



t 2 /2 



(61) 



where we note that E||sfc|| 2 = yj^- We choose t = (\/2 — l)\fW t , and obtain, 
after forming the union bound over k — 1, . . . , N-t — 1, 



AT T 



(62) 



fe=i 
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The bound (j57p now follows from (1601) . 

Next we establish a coherence bound for A. 

Lemma 7 Let A be as defined in the Doppler case. Assume that 

N 



N>y/Nt,\og(NNp)<-^, (63) 



where N := max{iV T , N f , ^N T N f }. Th 



en 



max (A Tj>/3 , A T ,j,^}\ < 3N R JN t log(N T N f Np) 

with probability at least 1 - 2(N R N T )~ 1 - 2(N T N R N T y 1 - 2(N f N R N T )~ 1 - 
6(N T N f N R N T )- 1 . 

Proof: 

We have that A. T ,f,p — a /?(/3)® (S r ,/a T (/3)). A standard calculation shows 
that 

|(S rJ a T (/3),S r , f a T (/n>l = |(S r _ T ,, / _ / ,a T (/3),a T (/? / )}| (64) 

for r, t' = 0, . . . , N T — 1, /, /' = 0, . . . , Nf — 1, thus we only need to consider 
\{S t jb.t(I3), Sa.T(/3'))\. As in the proof of Lemma [3] we distinguish several 
cases. 

Case (a) /3 ^ (3', r = 0, / = 0: In this case we are concerned with 
|(Sa r (/3), Sa T ( / 9'))|, which is the same as Case (i) of Lemma El except that 
in the present case we have a bit more flexibility in choosing t in the anal- 
ogous version of (1291) . Here we can choose t = 3^/^-\og(NN R N T ), where 



N = max{iV r , Nf, ^N T Nf}. Proceeding then as in the proof of Case (i) of 
Lemma |3] we obtain 

p(|(A Tji/3 , A r , /i/? ,)| < 3N R y/N t log(N T N R N T )) > 1 - 2(N R N T )-\ (65) 

Case (b) f3 ^ (3', r ^ 0, / = 0: This is exactly the same as Case (ii) of 
Lemma [31 We obtain 

p(|(A Tji/3 , A T ,j^,)\ < 3N R y/N t log{N T N R N T j) > 1 - 2(N T N R N T )-\ (66) 



Case (c) ^ j3', r = 0, / ^ 0: It is well known that (T T x) A = M_ r x. 
Hence, by Parseval's theorem, (T T x, y) = (M_ T x, y). Since the normal 
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distribution is invariant under Fourier transform, this case is therefore already 
covered by Case (b), and we leave the details to the reader. We get 

p(|(A t , /]/3 , A T)/ ^>| < 3N R ^N t log{N f N R N T j) >l-2(N f N R N T )-\ (67) 

Case (d) /3 ^ /3', r ^ 0, / ^ 0: This is similar to Case (ii) of Lemma [3j The 
only difference is that we have N t NfN R Nj< different possibilities to consider 
when forming the union bound (the additional factor Nf is of course due to 
frequency shifts associated with the Doppler effect). Thus in this case the 
bound reads 



P(|(A T)/ii3) A tW )| < 3N R ^N t \og(N T N f N R N T )^ > 1 - 2(N T N f N R N T )- 1 . 

(68) 

Case (e) (3 = 0': We need to bound |(T r M / Sa T (/3), Sa T (/3))|, where we 
recall that Sa-r(/3) is a Gaussian random vector with variance Nt- (We 
note that a related case is covered by Theorem 5.1 in [20], which considers 
{T T Mfh, h), where h is a Steinhaus sequence.) This case is essentially taken 
care off by Case (iii) of Lemma [31 by noting that a Gaussian random vector 
of variance a remains Gaussian (with the same a) when pointwise multiplied 
by a fixed vector with entries from the torus. The only difference is that, 
as in Case (d) above, we have N t NfN R N T different possibilities to consider 
when forming the union bound. Hence, the bound in this case becomes 

p(max|<A Ti/) £, A tW }| < 3N R ^/ N t hg(N T N f N R N T )) > l-4(iV 4 iV / iV /? iV T )- 1 . 

(69) 

Lemma 8 Let A = AD" 1 , where the entries of the N T NfNj3 x N T NfN/3 
diagonal matrix are given by ~D(r,f,0),(T,f,i3) — ||A Ti/ g||2. Under the conditions 
of Theorem U\ there holds 



p(||A|| 2 op <6iV T )>l-p 1 , (70) 



where 
and 



(1-073)^, 



Pi = e 2 + j\ T e 



+ JVre-tv 3 / 2 -^)^ 



p(/i(A) < Qj^\og(N T N f N R N T )^ >l-p 2 , (71) 



N t 
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where 

111 i iV+(\/2-l) 2 

p 2 = 2(N R N T y 1 +2(N T N R N T y 1 +2(N f N R N T y 1 +6(N T N f N R N T y 1 +2e 3 

Proof: Since the proof of this lemma follows closely that of Lemma HI we 
omit it. □ 



5 Numerical Experiments 

Next we illustrate the performance of the compressive MIMO radar developed 
in previous sections. We consider a Doppler-free scenario. The following 
parameters are used in this example: Nt = 8 transmit antennas, N R = 8 
receive antennas, Nt = 64 samples, N r = N t range values. 

At each experiment K scatterers of unit amplitude are placed randomly 
on the range/azimuth grid, i.e the vector x has K unit entries at random 
locations along the vector. White Gaussian noise is added to the composite 
data vector Ax with variance a 2 determined to as to produce the specified 
output signal-to-noise ratio (see also item (iv) of the Remark after Theo- 
rem [TJ. The lasso solution x is calculated with A as specified in Theorem [TJ 
The numerical algorithm to solve ffTUl) was implemented in Matlab using 
TFOCS [1]. The experiment is repeated 100 times using independent noise 
realizations. 

The probabilities of detection Pd and false alarm Pf a are computed as 
follows. The values of the estimated vector x corresponding to the true scat- 
terer locations are compared to a threshold. Detection is declared whenever 
a value exceeds the threshold. The probability of detection is defined as the 
number of detections divided by the total number of scatterers K. Next the 
values of the estimated vector x corresponding to locations not containing 
scatterers are compared to a threshold. A false alarm is declared whenever 
one of these values exceeds the threshold. The probability of false alarm is 
defined as the number of false alarms divided by the total number of scatter- 
ers K. The probabilities of detection and false alarm are averaged over the 
100 repetitions of the experiment. 

The probabilities are re-computed for a range of values of the threshold to 
produce the so-called Receiver Operating Characteristics (ROC) [HI [281 [2Sj 
- the graph of Pd vs. Pf a . As the threshold decreases, the probability of 
detection increases and so does the probability of false alarm. In practice the 



27 



threshold is usually adjusted to as to achieve a specified probability of false 
alarm. 

Figures [Q [21 [3] and H] depict the ROC for different values of the output 
signal to noise ratio. We note that the probability of detection increases as 
the SNR increases and decreases as K, the number of scatterers increases. 



Pd vs Pfa, SNR = 1 5, Kmax = 20, N T = 8, N R = 8, N t = 64 
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Figure 1: Probability of detection vs. probability of false alarm for SNR 
15 dB, and three values of K: i^ max /2, K max , 2i^ max . 



6 Conclusion 

Techniques from compressive sensing and sparse approximation make it pos- 
sible to exploit the sparseness of radar scenes to potentially improve system 
performance of MIMO radar. In this paper we have derived a mathemat- 
ical framework that yields explicit conditions for the radar waveforms and 
the transmit and receive arrays so that the radar sensing matrix has small 
coherence and robust sparse recovery in the presence of noise becomes pos- 
sible. Our approach relies on a deterministic (and very specific) positioning 
of transmit and receive antennas and random waveforms. It seems plausible 
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PdvsPfa, SNR = 20, Kmax = 20, N_T = 8, N_R = 8, N_t = 64 



1 




0.9 




0.8 




0.7 




0.6 

2 0.5 


^ 

* 
* 
* 
t 

* 


0.4 


t 


0.3 
0.2 


• 

» V 

1 ' 


0.1 


/ 

-i 



■K=10 
■K=20 
■K=40 



0.02 0.04 0.06 0.08 0.1 0.12 0.14 0. 

Pfa 



16 



Figure 2: Probability of detection vs. probability of false alarm for SNR = 
20 dB, and three values of K: K max /2, K max , 2K max . 

that results similar to the ones derived in this paper can be established for 
the case where the antenna locations are chosen at random and the trans- 
mission signals are deterministic. This would be of interest, since one could 
then potentially take advantage of specific properties of recently designed 
deterministic radar waveforms such as in [21 [19] . 



Appendix A 

In this appendix we collect some auxiliary results. 

Lemma 9 [29^ Proposition 34] Let x G C n be a vector with x^ ~ CJ\f(0, a 2 ), 
then for every t > one has 



P llxl 



Ellxllo >t < e 



(72) 



The following lemma, which relates moments and tails, can be found e.g. 
in [221 Proposition 6.5]. 
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PdvsPfa, SNR = 25, Kmax = 20, N_T = 8, N_R = 8, N_t = 64 
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Figure 3: Probability of detection vs. probability of false alarm for SNR 
25 dB, and three values of K: i^ max /2, K max , 2J<" max . 



Lemma 10 Suppose Z is a random variable satisfying 

(E|Z| p ) 1/p < a(5 l,p p lh for all p > p 
for some constants a,/3,j,po > 0. Then 

P(|Z| > e lh au) < (3e~ u " h 

for all u > p . 

The following lemma is a rescaled version of Lemma 3.1 in [23] . 

Lemma 11 Let A e c nxm fr e a Gaussian random matrix with A^j ~ CJ\f(0, a 2 ) . 
Then for all x, y G C m with ||x|| 2 = ||y|| 2 = ^fm and all t > 



P 



{| — -(Ax, Ay) - (x,y}| > tm\ <2exp ( - n— - 



f 



at 



with C\ = -4= and C 2 = \/Se. 

V D7T 
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PdvsPfa, SNR = 30, Kmax = 20, N_T = 8, N_R = 8, N_t = 64 
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Figure 4: Probability of detection vs. probability of false alarm for SNR = 
30 dB, and three values of K: i^ max /2, K max , 2K max . 

The next lemma is a slight generalization of a result by Hanson and 
Wright on tail bounds for quadratic forms |12j . 



Lemma 12 Let M = {m»j}£j- =1 be a normal matrix and let X iy i = 0, 

1 be independent, CM '(0,1) -distributed random variables. Denote 

n-l 

s n = 2_^ m ijXiXj. 

i,j=0 

Then for allt > 

F(s n > t + ES n ) <exp(-C min{ * , J 

II II U P II \\ r 

where C is a numerical constant independent of M and n. 



n- 



))■ 



Proof: The proof follows essentially the same steps as the proof of the 
main theorem in [12], which considers the case where M is hermitian and 
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the Xi are real-valued. Extending the Xj to the complex case is trivial, thus 
the only modification that needs to be addressed is the extension of M from 
the hermitian to the normal case. But Lemma 5 in [12] holds for normal 
matrices as well, therefore the lemma follows. □ 

For convenience we state the following version of Bernstein's inequality, 
which will be used in the proof of Lemma [T4l 

Theorem 13 (See e.g. | I27| ) Let Xi, . . . ,X n be independent random vari- 
ables with zero mean such that 

^\Xi\ p < -p\K p - 2 v h for all i = 1,. . . ,n;p GN,p>2, 

for some constants K > and Vi > 0, i — 1, . . . , n. Then, for all t > 

P(|^A,|>«)<2« P (- Srj - Js ), (73) 

where v := J^" =1 Vi. 

We also need the following deviation inequality for unbounded random 
variables. It is a complex-valued and slightly sharpened version of Lemma 6 
in [13], the better constant will be useful when we apply Lemma [TH in the 
proof of Lemma [3j 

Lemma 14 Let Xi and Yi, i = 1, . . . , n, be sequences of i.i.d. complex Gaus- 
sian random variables with variance a. Then, 

P(|£x^,|>t)<2exp(- J ). (74) 

Proof: In order to apply Bernstein's inequality, we need to compute the 
moments E|XjYj| p . Since Xi and Yi are independent, there holds 

EQXXl 9 ) = E(\Xi\*)E{\Yi\ p ) = (E(|X,H) 2 . 
The moments of Xi are well-known: 

E\X t \ 2p = p\o- 2p , 
hence 

(E|X,n 2 = (2p!) V P ) 2 < \^pW 2 ? p < l -{2p)\{° 2 Y p - 2 ^f- 

We apply Bernstein's inequality f!73|) with K = a 2 and v t = ^j-, i — 1, • • • , n 
and obtain (1741) . □ 
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Appendix B 

We consider a general linear system of equations \I/x = y, where \I/ G C nxm , 
x G C m and n < m. We introduce the following generic i^-sparse model: 

• The support I C {1, . . . ,m} of the K nonzero coefficients of x is se- 
lected uniformly at random. 

• The non-zero entries of sgn(x) form a Steinhaus sequence, i.e., sgn(x&) : = 
Xfc/|xfc|,fc G /, is a complex random variable that is uniformly dis- 
tributed on the unit circle. 

The following theorem is a slightly extended version of Theorem 1.3 in [3]. 

Theorem 15 Given y = \I/x + w ; where ^ has all unit-i-2-norm columns, 
x is drawn from the generic K -sparse model and Wj ~ CA/"(0,<r 2 ). Assume 
that 

M*) < r^-, (75) 

logm 

where Cq > is a constant independent of n,m. Furthermore, suppose 

K < ,,^7 (76) 

II* II % logm 

for some constant Cq > and that 



min |xfc| > 8a a/2 logm. (77) 

k£l 



Then the solution x to the debiased lasso computed with X = 2<j\/21ogm 
obeys 

supp(x) = supp(x), (78) 

and 



II X I|2 ||y||2 

with probability at least 



x-x|| 2 ^oV3n (7g) 



1 - 2m.- 1 (27rlogm + Km- 1 ) - C(m" 21og2 ). (80) 
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Proof: The paper [3] treats only the real-values case. However it is not 
difficult to see that the results by Candes and Plan can be extended to the 
complex setting if their definition of the sign-function is replaced by ([9]) and 
consequently their generic sparse model is replaced by the generic sparsity 
model introduced in the beginning of this appendix. The proofs of the theo- 
rems in [3] can then be easily adapted to the complex case via some straight- 
forward modifications, such as replacing in many steps (-, •) by its real part, 
Re(-, •) and replacing certain scalar quantities by its conjugate analogs. To 
give a concrete example of such a modification, consider (in the notation 
of [3j) the inequality right before eq.(3.10) in [3J, 

\Pi\ = \Pi + hi\> |/3 4 |+sgn(A)^- 
This inequality needs to be replaced by its complex counterpart 

|A| = |A + hi\ > |A| + Re(sgn(/3 4 )7^). 

By carrying out these easy modifications (the details of which are left to 
the reader) we can readily establish (I78p analogous to (1.11) of Theorem 1.3 
in[3J. 

Once we have recovered the support of x, call it /, we can solve for the co- 
efficients of x by solving the standard least squares problem min || A/X/ — y || 2 , 
where A/ is tbe submatrix of A whose columns correspond to the support 
set /, and similarly for x/. Statement ( 1791) follows by noting that the proof of 
Theorem 3.2 in [3] yields as side result that with high probability the eigen- 
values of any submatrix A} A/ with |/| < K are contained in the interval 
[1/2,3/2], which of course implies that n(Aj) < a/3. The statement follows 
now by substituting this bound into the standard error bound, eq. (5.8.11) 
in [UJ. □ 
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